Picture for Kai Yang

Kai Yang

Sherman

RLVR Datasets and Where to Find Them: Tracing Data Lineage for Better Training Data

Add code
May 26, 2026
Viaarxiv icon

FD-RAG: Federated Dual-System Retrieval-Augmented Generation

Add code
May 22, 2026
Viaarxiv icon

SafeAlign-VLA: A Negative-Enhanced Safe Alignment Framework for Risk-Aware Autonomous Driving

Add code
May 19, 2026
Viaarxiv icon

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Add code
May 13, 2026
Viaarxiv icon

Debiased Model-based Representations for Sample-efficient Continuous Control

Add code
May 12, 2026
Viaarxiv icon

C-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving

Add code
May 11, 2026
Viaarxiv icon

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Add code
May 07, 2026
Viaarxiv icon

AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation

Add code
Apr 27, 2026
Viaarxiv icon

When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compression

Add code
Apr 04, 2026
Viaarxiv icon

A Joint Graph-Cut Channel Estimation Method for Multi-user Holographic MIMO

Add code
Mar 18, 2026
Viaarxiv icon